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Abstract —Digital maps have become a part of our daily life 
with a number of commercial and free map services. These 
services have still a huge potential for enhancement with rich 
semantic information to support a large class of mapping appli¬ 
cations. In this paper, we present Map++, a system that leverages 
standard cell-phone sensors in a crowdsensing approach to auto¬ 
matically enrich digital maps with different road semantics like 
tunnels, bumps, bridges, footbridges, crosswalks, road capacity, 
among others. Our analysis shows that cell-phones sensors with 
humans in vehicles or walking get affected by the different road 
features, which can be mined to extend the features of both 
free and commercial mapping services. We present the design 
and implementation of Map++ and evaluate it in a large city. 
Our evaluation shows that we can detect the different semantics 
accurately with at most 3% false positive rate and 6% false 
negative rate for both vehicle and pedestrian-based features. 
Moreover, we show that Map++ has a small energy footprint on 
the cell-phones, highlighting its promise as a ubiquitous digital 
maps enriching service. 

I. Introduction 

Recently, digital maps have gained great attention due to 
their high economic and social impact; They are integrated 
into our everyday lives in different forms such as navigation 
systems, traffic estimation services, location based services, 
asset tracking applications, and many more. Realizing the 
economic value of this technology, several giant companies are 
producing commercial map services including Google Maps 
|[^, Yahoo! Maps |T0| , and Microsoft’s Bing Maps Q, as 
well as free services such as OpenStreetMaps ||^. These 
map services attract millions of users daily. In 2013, Google 
announced that its Google Maps service is accessed by over 
one billion users every month 0 . 

Typically, these maps are constructed through satellite im¬ 
ages, road surveyors, and/or manual entry by trained personnel 
0- However, with the dynamic changes and richness of the 
physical world, it is hard to keep these digital maps up- 
to-date and capture all the physical world road semantics. 
To address this issue, commercial map companies started to 
provide tools, e.g. Google’s MapMaker 0 and Nokia’s HERE 
Map Creator for users to manually send feedback about 
their maps, i.e. crowdsource the map updates. This was even 
generalized to build entire completely-free editable maps such 
as OpenStreetMap (OSM) |[8| and WikiMapia 0. However, 
these services require active user participation and are subject 
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to intentional incorrect data entry by malicious users. 

With the proliferation of today’s sensor-rich mobile de¬ 
vices, cell phones are becoming the bridge between the physi¬ 
cal and digital worlds. Researchers leveraged the GPS chips on 
smart phones to collect traces that can be used automatically 
to update existing maps and infer new roads (g, ig, 1^. 
However, GPS is an energy hungry device and these systems 
focus only on estimating missing road segments. In summary, 
all existing mapping services, both commercial and free, miss 
a large number of semantic features (Eigure that are a 
necessity for many of today’s map-based applications. Eor 
example, navigation systems relay on important semantics to 
better guide users to their destinations; a short route may be 
falsely tempting if traffic lights are hidden from the user, a 
pedestrian tourist might be deceived when finding out that 
the road has no sidewalks, city evacuation planning might be 
ineffective if maps are not tagged with the number of lanes, 
a driver might be at risk of an accident if his map does not 
show the road bumps ahead, and a person with disability needs 
a map that shows the elevator-enabled subway stations. 

In this paper, we present Map-F-F as a system that leverages 
the ubiquitous sensors available in commodity cell-phones to 
automatically discover new map semantics to enrich digital 
maps. Our system depends only on time- and location-stamped 
inertial sensors measurements, which have a low-energy profile 
for both road semantics estimation and accurate localization, 
removing the need for the energy-hungry GPS. Eor example, 
a phone going inside a tunnel will experience a drop in the 
cellular signal strength. This can be leveraged to detect the 
tunnel location. Map-F-F uses a classifier-based approach based 
on the multi-modal phone sensor traces from inside cars to 
detect tunnels and other road semantics such as bridges, traffic 
calming devices (e.g. bumps, cat-eyes, etc), railway crossings, 
stop signs, and traffic lights; In addition, it uses pedestrians’ 
phone sensor traces to detect map semantics like underpasses 
(pedestrian tunnels), footbridges (pedestrian bridges), cross¬ 
walks, stairs, escalators, and number of lanes. 

We present the Map-F-F system architecture as well as the 
details of its components. Implementation of Map-F-F over dif¬ 
ferent android phones shows that we can detect different map 
features accurately with 3% false positive rate and 6% false 
negative for in-vehicle traces, and 2% false positive rate and 
3% false negative rate for pedestrian traces. In addition, Map-F-F 
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Fig. 1. An example from the city of Alexandria, Egypt showing that both commercial and free mapping services miss a number of semantic features for next 
generation maps. The satellite image is used to show the actual area and is annotated with the full semantics extracted by Map++; The majority of these features 
are missing in all current mapping services {shadowed ellipses). Google Maps even does not identify the bridge and defines it as a normal road. 


can detect the location of the detected features accurately to 
within 2m using as few as 15 samples without using the GPS 
chip. This comes with a low power consumption of 23mW, 
which is 50% less than GPS when run at a 1 minute duty 
cycle. 

In summary, our contributions are three-fold: 


• We present the Map-f-f architecture to automatically 
crowdsense and identify map semantics from available 
sensor readings without inferring any overhead on the 
user and with minimal energy consumption. 

• We provide a framework for extracting the different 
map features from both pedestrian and in-vehicle 
traces. 

• We implement Map-f-i- on Android devices and evalu¬ 
ate its accuracy and energy-efficiency in a typical city. 


The rest of the paper is organized as follows: Section |IJ 
presents the system overview. We give the details of extract¬ 
ing the road semantic information from phone sensors with 
pedestrians and in vehicles in sections [nl| and [Tvl respectively. 
Section |V] provides the implementation and evaluation of 
Map-f-f. Section VI discusses related work. Finally, Section 
I VII| provides concluding remarks and give directions for future 
work. 
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II. System Overview 

Figure shows the Map-f-f system architecture. Map-f-f is 
based on a crowdsensing approach, where cell phones carried 
by users submit their data to the Map-f-f service running in 
the cloud in a way transparent to the user. The data is first 
processed by Map-f-f to reduce the noise. Then, the user mode 
of transportation is classified to separate pedestrian and in- 
vehicle traces from other modes of transportation. Map-F-F has 
two core components: one for extracting map features from 
in-vehicle traces and the other for extracting the map features 
from pedestrian traces. Map-F-F takes a classifier approach 
to determine the different road semantics based on extracted 


Fig. 2. The Map-i-i- system architecture. 

features from the collected sensor traces. We give an overview 
of the system architecture in the following subsections and 
leave the details for the semantics detection to sections |III| and 

lEl 

A. Traces Collection 

The system collects time-stamped and location-stamped 
traces along with sensor measurements. These include avail¬ 
able inertial sensors (such as accelerometer, gyroscope and 




























































magnetometer) as well as cellular network information (asso¬ 
ciated cell tower ID and its received signal strength (RSS), 
plus neighboring cell towers and their associated RSS). These 
sensors have a low cost energy profile and they are already 
running all the time during the standard phone operation to 
maintain cellular connectivity or to detect phone orientation 
changes. Therefore, using them consumes zero extra energy. 

Map-F-F requires accurate location information while con¬ 
suming low power; GPS, GSM, and WiFi fingerprinting lo¬ 
calization techniques fT^ , |[23|-|[26l, p6| fail to provide 
both. Hence, Map-F-F leverages our Dejavu energy-efficient and 
accurate localization system m that can provide accuracy 
better than GPS in urban conditions with much lower energy 
consumption. To achieve this, Dejavu uses a dead-reckoning 
approach based on the low-energy phone inertial sensors. 
However, to reduce the accumulated error in dead-reckoning, 
Dejavu leverages the amble and unique physical and logical 
landmarks in the environments; such as turns, curves, and 
cellular signal anomalies; as error resetting opportunities. 
Dejavu can achieve a median distance error of 8.4m in in-city 
driving conditions and 16.6m in highway driving conditions 
with a 347.4% enhancement in energy-consumption compared 
to the GPS. Therefore, Map-F-F energy efficiency is based on 
Dejavu's energy-efficient localization and using the inertial and 
cellular sensors information for its analysis. We quantify this 
in SectioifVl 

B. Preprocessing 

This module is responsible for preprocessing the raw sensor 
measurements to reduce the effect of (a) phone orientation 
changes and (b) noise and bogus changes, e.g. sudden breaks, 
or small changes in the direction while moving. To handle 
the former, we transform the sensor readings from the mobile 
coordinate system to the world coordinate system leveraging 
the inertial sensors. To address the latter, we apply a low-pass 
filter to the raw sensors data using local weighted regression 
to smooth the data GB- 

C. Transportation Mode Detection 

Map-F-F is designed to detect two main classes of map 
semantics; in-vehicle and pedestrian as well as to filter other 
classes, such as train traces. We start by filtering users inside 
buildings. Different approaches have been proposed in liter¬ 
ature based on the different phone sensors 
Map++ uses the lODetector approach due to its accuracy 
and low-energy profile. 

Similarly, transportation mode detection for outdoor users 
has been thoroughly studied in the literature | [3T| , 1^ , p7| . 
We follow the approach proposed by | [T7| that provides high 
accuracy of differentiation between the different transportation 
modes based on the energy-efficient inertial sensors. The 
technique starts by segmenting the location traces p8| using 
velocity and acceleration upper bounds. Then the following 
features are used to classify each segment: The stopping rate, 
the heading and velocity change rate, the segment length, 
the maximum velocity and acceleration, average velocity. 


and velocity variance. A decision tree classifier is applied to 
identify the transportation mode for each segment. 

Once the mode of transportation is detected, a map-matcher 
is applied to the in-vehicle traces to map the estimated 
locations to the road network to reduce the localization error. 
Similarly, the UPTIME step detection algorithm fT3| that takes 
into account the different users’ profiles and gaits is applied 
to the pedestrian acceleration signal to detect the user steps. 
In both cases, features are extracted from the traces to prepare 
for the road semantic classification step. 

D. Map Semantics Extraction 

There are a large number of road semantic features that can 
be identified based on their unique signature on the different 
phone sensors. Map-F-F uses a tree-based classifier to identify 
the different semantics as detailed in Section (pedestrian- 
based semantics) and Section [rv| (vehicle-based semantics). 

E. Road Semantic Eeatures Location Estimation 

Whenever a road semantic feature is detected by the 
semantic detection modules (in-vehicle or pedestrians), Map-F-F 
needs to determine whether it is a new instance of the road 
feature or not as well as determine its location. 

To do this, Map-F-F applies spatial clustering for each type 
of the extracted semantics. It uses density-based clustering 
algorithms (DBSCAN fTl]). DBSCAN has several advantages 
as the number of clusters is not required before carrying 
out clustering; the detected clusters can be represented in an 
arbitrary shape; and outliers can be detected. The resulting 
clusters represent map features. The location of the newly 
discovered semantics is the weighted mean of the points inside 
their clusters. We weight the different locations based on their 
accuracy reported by Dejavu: In Dejavu, the longer the user 
trace from the last resetting point, the higher the error in 
the trace m- Therefore, shorter traces have better accuracy. 
When a new semantic is discovered, if there is a discovered 
map feature within its neighborhood, we add it to the cluster 
and update its location. Otherwise, a new cluster is created to 
represent the new road feature. To reduce outliers, a semantic 
is not physically added to the map until the cluster size reaches 
a certain threshold. 

E Practical Considerations 

Sensor specifications are different from one phone manu¬ 
facturer to another, which leads to different sensor readings for 
the same map feature. To address this issue, Map-F-F applies 
a number of techniques including use of scale-independent 
features (e.g. peak of acceleration) and combining a number 
of features for detecting the same semantic feature. 

Map-F-F does not also require real-time sensor data col¬ 
lection; it can store the different sensor measurements and 
opportunistically upload them to the cloud for processing; 
allowing it to save both communication energy and cost. This 
is outside the scope of this paper. 



Fig. 3. A decision tree classifier for detecting different map features from pedestrian traces. 


III. Pedestrian Traces Semantic Detection 
Module 

To determine the different road semantics, Map++ applies 
a decision tree classifier to the extracted features from the 
pedestrian traces. Figure shows the decision tree classifier 
used to extract the different semantic map features from the 
pedestrian traces. We give the details of the classifier features 
that can differentiate the different semantic road features 
(underpasses, stairs, escalators, footbridges, crosswalks, and 
number of lanes) in the next subsections. 

A. Underpasses (Pedestrian Tunnels) 

Underpasses or pedestrian tunnels are specially constructed 
for pedestrians beneath a road or railway, allowing them to 
reach the other side. A pedestrian trace crossing a road may 
be a crosswalk (e.g. zebra lines), a bridge, or an underpass. 
We identify the underpasses from other classes by their unique 
features: Walking inside an underpass, a cell-phone will expe¬ 
rience a drop in the cellular signal and also a high variance in 
the magnetic field around it (both Y and X axes) due to metals 
and electricity lines inside the tunnel (Figure]^. 

B. Stairs 

Furthermore, when ascending or descending stairs, the fre¬ 
quency of steps, detected by a simple peak detector (Figure |^, 
within the unit distance increases since the user is moving 
vertically. When descending stairs, the gravity force affecting 
the person will lead to a higher peak in acceleration, and 
hence higher variance, as compared to walking (Figure |^. 
The number of steps can be used, e.g., to determine the height 
of the pedestrian bridge, which is useful for determining the 
height limits for the vehicles on the road. 

C. Escalators 

When using escalators, users typically keep standing while 
carried by the moving staircase. Therefore, the acceleration 
variance remains small compared to walking. However, esca¬ 
lators are often powered by constant-speed alternating current 
motors, which results in high variance in the magnetic field 
(Figure [ 7 ]). 
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Fig. 4. The effect of walking in an underpass on the cellular signal strength 
and the ambient magnetic field. Typically, the cell-phone will experience a 
drop in the cellular signal and also a high variance in the magnetic field in 
both sides of the tunnels. 
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Eig. 5. Step pattern while walking versus the step pattern when going down 
stairs. The stairs are clear from the walking pattern as they have a higher peak 
in comparison to the normal walking peaks. The number of steps is detected 
with a simple peak detector. 
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Fig. 6. The figure compares the effect of descending stairs to the effect 
of walking on the acceleration variance and the steps frequency. Descending 
stairs leads to a higher magnitude in acceleration and higher frequency. 
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Fig. 7. Variance of acceleration and ambient magnetic field while being 
stationary, walking, and using the escalator. 
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Fig. 9. A decision tree classifier for detecting different map features from 
in-vehicle traces. 


E. Crosswalk 

A crosswalk or pedestrian crossing is a designated point 
on a road to assist pedestrians wishing to cross, they could 
be found at intersections and busy roads. Crosswalks could 
be detected from pedestrian traces; a point on the road where 
users cross to another road without footbridges or underpasses. 


F. Number of Lanes 

Road capacity is an important feature for applications like 
traffic estimation and evacuation route planning. We detect the 
number of lanes from the road width divided by the average 
lane width in the region. Road lanes typically have an accepted 
width range set by an authority for every country j^. We detect 
road width from the length of the crosswalk, the footbridge, 
or the underpass that helps pedestrian cross it. Since users can 
have different patterns for crossing the road, we found that 
using the minimum road length leads to an accurate estimate 
for the number of lanes. 

IV. In-Vehicle Traces Semantic Detection 
Module 


Fig. 8. Acceleration variance when going up then walking on the footbridge 
then going down on stairs. 


We extract the different map semantic features from the 
traces collected by the in-vehicle users. Figure shows the 
tree classifier used to detect the different semantics. 


D. Footbridges (Pedestrian bridges) 

Similar to underpasses, footbridges allows pedestrians to 
safely cross roads, railways and rivers. A user crossing a 
footbridge will use stairs/escalators to ascend and descend. In 
between, the user will walk the length of the footbridge (Fig¬ 
ure [^. We separate footbridges from crossroads by detecting 
the stairs/escalator pattern before/after using them; we separate 
them from underpasses using the cellular signal which drops 
in the underpasses case but not in the footbridge case. 


A. Tunnels 

Similar to the underpass case, a car going inside a tunnel 
will typically experience an attenuated cellular signal. We also 
noticed a large variance in the ambient magnetic field in the 
x-direction (perpendicular to the car direction of motion) while 
the car is inside the tunnel. This is different from the underpass 
case, where there is no smooth ramp at the end and hence both 
the X and y magnetic fields are affected. Therefore, car tunnels 
have a low variance in the y-axis (direction of car motion) 
magnetic field. 
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Fig. 10. Effect of the different map features on the Y-axis gravity acceleration. 
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B. Bridges 

Bridges cause the car to go up at the start of the bridge 
and then go down at the end of the bridge. This is reflected on 
the Y-gravity or Z-gravity acceleration (Figure Although 
other classes, such as bumps, cause the same effect (Y or Z 
gravity acceleration going up then down), bridges are unique 
in having this effect over a longer distance GD- The bridge 
is detected at its end. Note that after detecting the end of the 
bridge, we could identify its starting point. 

C. Traffic Calming Features 

Different traffic calming techniques like bumps, speed 
humps, and cat’s eyes all cause the car to move up then 
down similar to bridges, affecting all gravity acceleration axes. 
However, unlike bridges, all these classes affect the gravity 
acceleration over a small distance. To further separate these 
classes, we employ other sensors using the approach in GD 
as follows (Figure pT|): 

Vertical deflection devices (e.g., speed bumps, humps, 
cushions, and speed tables): As the vehicle hits such devices, 
large spikes in variance in the Y-axis and Z-axis gravity 
acceleration are sensed compared to the other classes. 

Cat’s eyes: Unlike other road anomalies, the cat’s eyes 
structure does not cause the car moving above them to have 
high variance in the Y or Z-axis gravity acceleration. 

D. Railway Crossing 

Railway crossings leads to a medium variance in the Y- 
axis and Z-axis gravity acceleration over a longer distance 
compared to other road anomalies GD- In addition, they cross 
a railway if available on the map. 
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Fig. 11. Effect of different traffic calming devices on the X, Y, and Z gravity 
acceleration variance in comparison with smooth road and railway crossing. 
Cat’s eyes have the lowest Y and Z variance, bumps have the highest Y and 
Z variances, while the railway crossing has a medium Y and Z variance. 


(Figure [T^, we can leverage the orientation angle sensor to 
identify the roundabouts by the differences between their start 
and end orientation angles (Figure p3]). 


E. Roundabouts and Intersections 


A roundabout is a type of circular junction in which road 
traffic must travel in one direction around a central island. 
While a four-way intersection are typically two perpendicular 
crossing roads (Figure [T^. Roundabouts can be identifled as 
normal crossings by some commercial map services as shown 
in Figure 


Noting that a four-way intersection will only have sharp 
90° turns; while a roundabout will have both turns and curves 


F. Stop Sign and Traffic Light 

Traffic lights and stop signs are often used at road inter¬ 
sections, pedestrian crossings and other locations to control 
competing flows of traffic. Different approaches were proposed 
to identify both traffic regulators. We use the approach pro¬ 
posed in GD to detect the location and timing of both stop- 
signs and traffic lights based on the location traces. From both 
traffic lights and stop signs characteristics, all traffic passing 
by a stop sign should stop or at least slowdown (due to driver 
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Fig. 12. Difference between a roundabout and an intersection in terms of 

car behavior Effect of number of samples on the accuracy of estimating the 

semantic location. 
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Fig. 13. Moving along the curve the car direction (orientation) changes till 
the end of the curve, where the car returns to its original direction. This is 
unlike taking a turn, where direction changes after the end of the curving part 
remains. 


behavior), while the behavior of vehicles approaching a traffic 
light depends on if the light is actually red or green. So 
typically only a fraction of vehicles stops. 

They identify a potential stop sign if at least 80% of the 
traces slow down at the intersection and identify a potential 
traffic light if at least 15% of the traces slow down at the 
intersection. If all or all but one directions belonging to a given 
intersection are marked as potential stops, the intersection is 
identified as stop-sign regulated, and all the ways marked with 
a potential stops become actual stop signs; and an intersection 
is considered regulated by a traffic light if half plus one of the 
incoming directions are marked as potential traffic light. 

V. Evaluation 

We implemented our system on different Android phones, 
including: HTC Nexus One, Samsung Galaxy Nexus, Galaxy 
Tab 10.1, and Samsung Galaxy S Plus. We carried out our 
experiments on different roads in the city of Alexandria, Egypt. 
The total distance for in-vehicle traces is 40km (covering 
an area of about lOkm^) and 15km (covering an area of 
about 1.5km^) for the pedestrian traces. Traces were collected 
by three subjects. The ground truth for the map features 
were marked manually. Eor the location, we used an external 
bluetooth satellite navigation system that uses both the GPS 
and GLONASS systems as ground truth and ignored the low- 
accuracy GPS embedded in the phone. 

Eor the rest of this section, we start by evaluating the 
semantics detection accuracy for the different roads, the loca¬ 
tion accuracy for the discovered map features, and the power 
consumption of Map-F-F as compared to GPS-based approaches 
that focus on updating the road segments only (B), (T§, 0. 



GPS GPS GPS Map++ 
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Fig. 15. Energy footprint of Map++ as compared to systems that use the 
GPS (^, (^, with different duty cycles. 


A. Road Features Detection Accuracy 

Tables and |IJ show the confusion matrices for detecting 
the different map semantics from in-vehicle and pedestrian 
traces, respectively. The tables show that different map features 
could be detected with small false positive and negative rates 
due to their unique signatures; we can detect the map semantics 
accurately with 3% false positive rate and 6% false negative 
rate from in-vehicle traces, and 2% false positive rate and 3% 
false negative rate from pedestrian traces. 

B. Discovered Semantic Road Features Location Accuracy 

Eigure shows that the errors in the location of the dis¬ 
covered map feature drop quickly as we increase the number of 
crowd-sensed samples. We can consistently reach an accuracy 
of less than 2 m using as few as 15 samples for all discovered 
map features. 

C. Power Consumption 

Eigure [T^ shows the power consumption of Map-f-i-, which 
is based on the inertial sensors for both road semantic detection 
and localization based on Dejavu, as compared to systems that 
detects the missing road segments only, based on the GPS 
traces (B), P^ , p4| with different duty-cycles. The power is 
calculated using the PowerTutor profiler and the android 
APIs using the HTC Nexus One cell phone. The figure shows 
that Map-F-F has a significantly lower energy profile compared 
to systems that are based on the GPS chip. Note that since 
inertial sensors are used during the normal phone operation, 
to detect the phone orientation change, Map-F-F practically 

















































TABLE L 


Confusion matrix for classifying different road semantics discovered erom in-vehicle traces. 
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TABLE IL CONEUSION MATRIX EOR CLASSIEYING DIEEERENT ROAD SEMANTICS DISCOVERED EROM PEDESTRIAN TRACES. 
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TABLE III. Summary eor state-oe-the-art digital map update techniques. 


Criteria 

Technique^^'''^''-^'--^,,^ 

Sensors Used 

Features Added 

Consumed Power 

CrowdAtlas |34j 

GPS traces 

New roads, road direction, and road type. 

GPS power 

SmartRoad 

GPS traces 

Unregulated intersections, intersections with traffic lights, and 

intersections with stop signs. 

GPS power 

Carisi et al. |17) 

GPS traces 

Traffic lights and stop signs. 

GPS power 

Map-F-F 

Inertial Sensors, 
GSM, Dejavu 

localization 

llinnels, bridges, traffic calming, railway crossing, stop signs, 
underpasses, footbridges, stairs, bridge height, crosswalk, 
roundabouts, and number of lanes. 

Inertial sensors 
power 


consumes zero extra sensing power in addition to the standard 
phone operation. 


VI. Related Work 


Table compares Map++ to the state-of-the-art digital 
map update techniques. 


For indoor maps, recently in |T4| , p5| authors pro¬ 
posed indoor maps construction and inferring indoor structures 
like elevators using sensors available on smart-phones. Map-F-F 
uses a similar approach for semantics inference. However, 
outdoor maps have completely different diverse semantics. 

B. Road Semantics 


A. Digital Map Update 

Digital maps contain inaccuracies that may limit their value 
to different applications. Automatic map update is a promising 
solution for these inaccuracies. Recent work introduced map 
inference using user trajectories for indoor and outdoor digital 
maps. For outdoor maps, raw GPS traces were used to update 
the maps. For example, in CrowdAtlas p4| , authors used 
a map matching approach to infer missing roads on OSM 
using raw GPS traces. They also identified turns locations 
on the map. Similarly, in GI). authors proposed a system 
to extend digital maps with the location and timing of stop- 
signs and traffic lights in a city, using GPS traces collected by 
vehicles. Map-F-F complements these systems and uses a larger 
set of sensors, allowing it to detect significantly more semantic 
features. In addition, the sensors it uses for features detection 
and localization have a significantly lower-energy profile than 
GPS. 


Recently, inertial sensors embedded in smart phones al¬ 
lowed detection of the different road features. In I fni , we 
showed that a finite state machine could be applied to inertial 
sensors to recognize different physical and logical landmarks 
(e.g. bridges, turns, and cellular signal anomalies) for in- 
vehicle mobile phones. The goal was to provide an accurate 
and energy-efficient GPS replacement. Map-F-F extends this 
work by adding more semantic features, such as roundabouts, 
as well as using a classifier-based approach, which provides the 
same accuracy in a more intuitive way with simpler implemen¬ 
tation and more compact representation. In addition, Map-F-F 
enriches the road semantic by a novel class of pedestrian- 
based semantic features such as underpasses, footbridges, road 
capacity among others. 

Monitoring road condition using inertial sensors was pro¬ 
posed in pm , p9| - They mainly use the inertial sensors to 
detect the potholes and traffic conditions and use GPS to 










































































localize the sensed road problems. Both systems use external 
sensor chips which have higher sampling rates and lower noise 
compared to chips on typical cell-phones in the market. In 
addition, they depend on the energy-hungry GPS. Map-F-F, on 
the other hand, detects a significantly richer set of features, 
both based on vehicle and pedestrian traces, using a lower 
energy-profile sensors available in commodity cell-phones. 

VII. Conclusion 

In this paper we presented Map-F-F: a system for auto¬ 
matically enriching digital maps via a crowdsensing approach 
based on standard cell phones. For energy efficiency, Map-F-F 
uses only low-energy sensors and sensors that are already run¬ 
ning for other purposes. We presented the Map-F-F architecture 
as well as the features and classifiers that can accurately detect 
the different road features such as tunnels, bridges, crosswalks, 
stairs, and footbridges from the user traces. 

We implemented our system using commodity mobile 
phones running the Android operating system and evaluated 
it in the city of Alexandria, Egypt. Our results show that 
we can detect the map semantics accurately with 3% false 
positive rate and 6% false negative for in-vehicle traces and 
2% false positive rate and 3% false negative for pedestrian 
traces. In addition, Map-F-F has a significantly lower energy 
profile compared to systems that are based on GPS. 

Currently, we are expanding the system in multiple di¬ 
rections including inferring higher level semantic information, 
such as different point of interests (POIs), using more available 
sensors and optimizing their power consumption, inferring 
more road features, among others. 
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